The complexity of properly learning simple concept classes
نویسندگان
چکیده
We consider the complexity of properly learning concept classes, i.e. when the learner must output a hypothesis of the same form as the unknown concept. We present the following new upper and lower bounds on well-known concept classes: • We show that unless NP = RP, there is no polynomial-time PAC learning algorithm for DNF formulas where the hypothesis is an OR-of-thresholds. Note that as special cases, we show that neither DNF nor OR-of-thresholds are properly learnable unless NP = RP. Previous hardness results have required strong restrictions on the size of the output DNF formula. We also prove that it is NP-hard to learn the intersection of ` ≥ 2 halfspaces by the intersection of k halfspaces for any constant k ≥ 0. Previous work held for the case when k = `. • Assuming that NP 6⊆ DTIME(2n2) for a certain constant 2 < 1 we show that it is not possible to learn size s decision trees by size s decision trees for any k ≥ 0. Previous hardness results for learning decision trees held for k ≤ 2. • We present the first non-trivial upper bounds on properly learning DNF formulas. More specifically, we show how to learn size s DNF by DNF in time 2 √ n log . The hardness results for DNF formulas and intersections of halfspaces are obtained via specialized graph products for amplifying the hardness of approximating the chromatic number as well as applying recent work on the hardness of approximate hypergraph coloring. The hardness results for decision trees, as well as the new upper bounds, are obtained by developing a connection between automatizability in proof complexity and learnability, which may have other applications. ∗Supported by CCR grant NCCR-0324906. †Supported by an NSERC Postgraduate Scholarship ‡Supported by NSF grant CCR-98-77049. §Part of this work done at Harvard University and supported by an NSF Mathematical Sciences Postdoctoral Research Fellowship. ¶Supported by NSERC and PREA research grants.
منابع مشابه
Learning Simple Concept Under Simple Distributions
This is a preliminary draft version. The journal version [SIAM. J. Computing, 20:5(1991), 911935] is the correct final version. However, the polynomial time computable universal distribution section in there is too sloppy. For a better treatment see ‘‘M. Li and P.M.B. Vitanyi, An Introduction to Kolmogorov Complexity and its Applications, Springer-Verlag, New York, Second Edition, 1997,’’ Secti...
متن کاملLearning from Partial Observations
We present a general machine learning framework for modelling the phenomenon of missing information in data. We propose a masking process model to capture the stochastic nature of information loss. Learning in this context is employed as a means to recover as much of the missing information as is recoverable. We extend the Probably Approximately Correct semantics to the case of learning from pa...
متن کاملOn the Proper Learning of Axis Parallel Concepts
We study the proper learnability of axis-parallel concept classes in the PAC-learning and exactlearning models. These classes include union of boxes, DNF, decision trees and multivariate polynomials. For constant-dimensional axis-parallel concepts C we show that the following problems have time complexities that are within a polynomial factor of each other. 1. C is α-properly exactly learnable ...
متن کاملTask Complexity Manipulation and Accuracy in Writing Performance
This study aimed to investigate the impact of task sequencing, along +/- reasoning demands dimension, on writing task performance in terms of accuracy. The study was motivated by Robinson’s Cognition Hypothesis (CH) as well as previous studies investigating the relationships between task complexity and second language production. The participants of the study were 90 intermediate students at t...
متن کاملAn Investigation into the Effects of Joint Planning on Complexity, Accuracy, and Fluency across Task Complexity
The current study aimed to examine the effects of strategic planning, online planning, strategic planning and online planning combined (joint planning), and no planning on the complexity, accuracy, and fluency of oral productions in two simple and complex narrative tasks. Eighty advanced EFL learners performed one simple narrative task and a complex narrative task with 20 minutes in between. Th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- J. Comput. Syst. Sci.
دوره 74 شماره
صفحات -
تاریخ انتشار 2008